Tags: bloom filter*

0 bookmark(s) - Sort by: Date / Title ↑ /

  1. usersDF.write.format("orc")
    .option("orc.bloom.filter.columns", "favorite_color")
    .option("orc.dictionary.key.threshold", "1.0")
    .option("orc.column.encoding.direct", "name")
    .save("users_with_options.orc")
    Find full example code at "examples/src/main/scala/org/apache/spark/examples/sql/SQLDataSourceExample.scala" in the Spark repo
    2021-12-01 Tags: , , , , by klotz
  2. A standard technique from the hashing literature is to use two hash functions h1(x)
    and h2(x)to simulate additional hash functions of the form gi(x) = h1(x)+ih2(x). We demonstrate
    that this technique can be usefully applied to Bloom filters and related data structures. Specifically,
    only two hash functions are necessary to effectively implement a Bloom filter without any loss in
    the asymptotic false positive probability. This leads to less computation and potentially less need for
    randomness in practice.
    2014-02-05 Tags: by klotz
  3. Adam Kirsch,* Michael Mitzenmacher†
    2020-03-16 Tags: , by klotz

Top of the page

First / Previous / Next / Last / Page 2 of 0 SemanticScuttle - klotz.me: tagged with "bloom filter"

About - Propulsed by SemanticScuttle